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Abstract —This paper introduces a novel algorithmic solution 
for the approximation of a given multivariate function by a 
nomographic function that is composed of a one-dimensional 
continuous and monotone outer function and a sum of univariate 
continuous inner functions. We show that a suitable approxima¬ 
tion can be obtained by solving a cone-constrained Rayleigh- 
Quotient optimization problem. The proposed approach is based 
on a combination of a dimensionwise function decomposition 
known as Analysis of Variance (ANOVA) and optimization over 
a class of monotone polynomials. An example is given to show 
that the proposed algorithm can be applied to solve problems in 
distributed function computation over multiple-access channels. 

Index Terms —Distributed computation, nomographic approx¬ 
imation, compute-and-forward, multiple-access channel 

I. Introduction 

Distributed function computation is not a new idea, however, 
there has been recently an emerging interest in communication 
strategies that use the wireless channel for function compu¬ 
tation over multiple-access channels HI, ||2|. Such strategies 
have the potential for huge performance gains expressed 
in terms of efficiency, complexity and signaling overhead. 
The approach of ||2| exploits the superposition property of 
the wireless channel for computation of some nomographic 
functions and is of high practical relevance because it is 
robust under practical impairments such as the lack of syn¬ 
chronization; in addition, there is little need for coordination 
between different sensors. The recent work H demonstrates 
a hardware implementation of the method presented in 121. A 
key ingredient thereby is that the function to be computed has 
a suitable nomographic representation that is used to match the 
process of function computation to the communication channel 
Q, 0. However, in ID, El, 0 an algorithmic method to 
obtain suitable nomographic representations of given functions 
is missing. 

The theoretical analysis of functions that can be written in 
a nomographic form has a long history, which dates back to 
Kolmogorov 0, Sprecher 0, 0 and Buck 0. The authors 
in 0 and 0 showed that every function has a nomographic 
representation, when the outer function can be discontinuous 
(e.g. a space-filling curve). Implementing such functions in 
digital signal processing systems, especially when based on 
space-filling curves as analysed in 0, is prohibitive and leads 
to designs that are notoriously susceptible to noise. 

Since many known nomographic representations of func¬ 
tions used in wireless sensor network (WSN) applications 


are too intricate for practical use, we argue in favor of 
considering nomographic representations that approximate the 
functions of interest and are easy to implement in distributed 
networks Q In this work, we propose a simple method for 
nomographic approximation that can be used to approximate 
functions dictated by given applications in a decentralized 
manner. In a nutshell, the proposed method can be used to 
approximate certain multivariate functions by means of some 
nomographic approximations that are composed of a monotone 
continuous outer function and continuous inner functions. 
When implemented in digital signal processing systems such 
inner and outer functions are more robust to noise and easier 
to compute with finite precision arithmetic. The proposed 
approach is an algorithmic way to approximate more functions 
to be computed via the methods in 0, 0. 

A. Notation 

Scalars, vectors, matrices and sets are denoted by lowercase 
a, bold lowercase a, bold uppercase A and calligraphic letters 
A, respectively. (•)^, fog and f~^ stand for transpose, 
function composition and function inverse. The sets of natural 
number, real numbers and D x D real symmetric matrices 
are denoted by N, M and while 0 is used to denote 

the vector of all zeros, where the size will be clear from the 
context. We use Cp{X^) to refer to the space of p-integrable 
(1 < p < c») real functions, Vk,d{X^) to the space of square 
integrable real polynomials of K variables of degree at most D 
in each variable, C{X^) to the space of continuous functions, 
and J\f {X^^ to the space of nomographic functions, i.e. func¬ 
tions that can be represented in the form ‘Pk{xk)), 

and all spaces defined on X^ := X x ... x X C 
respectively. If in addition the outer and inner functions tjj 
and ipk fulfill ipk G C{X) : X ^ Q M. \/k and 
tp G C(H') : O' —H, we denote the corresponding space 
hyAfciX^) 0 . 

H. System Model, Problem Statement and 
Theoretical Framework 

We consider a network consisting of iT G N sensors 
indexed by the set K. := iT}. The sensors observe 

’Notice that nomographic approximation is used in this paper to refer to a 
nomographic representation of some function that approximates the function 
of interest in some pre-dehned sense. 




Fig. 1. Separation and superposition based function compuation in networks. 


measurements x := [xi,..., xkY' G and the task of the 
network is to compute or approximate a multivariate function 

f : ( 1 ) 

at some pre-selected fusion nodeH The underlying computa¬ 
tion and communication scenario is illustrated in Fig. [T] 

A. Problem statement 

It was shown in El, 0 that a nomographic represenation 
of some given function admits an efficient reconstruction 
or estimation of this function over the wireless channel. 
Although every function has a nomographic representation 
of the form tp{^k=i fkixk)), the general construction of 
the inner functions and the outer function is not amenable 
to implementation on state-of-the-art hardware technologies. 
Therefore, given some function / defined by O, the problem 
is to find a suitable nomographic representation such that (in 
some sense) 

/(a;) Ri (2) 

In doing so, we assume the following: 

A. 1 tfj is monotone continuous, 

A.2 {ipk}kGJC are continuous, 

which is based on practical considerations concerning noise 
robustness and implementability on digital signal processing 
systems. To emphasize the importance of Assumptions A.l 
and A.2 on the space of representable functions, let us review 
some previously known results from literature. 

Fact 1. Representation by nomographic functions 

1) Let ipk be monotone increasing, ip be possibly discontin¬ 
uous, then we have Bl, Ml/ 

f{x) =ip(^^^_^Pk{xk)j ,\/f eC{x^). ( 3 ) 

2) Let ifk G C(R), V' G C(R), then the following holds: 

• J\fc{X^) is a nowhere dense subset ofC{X^) 

• fix) = 'fpi{Y)k=iPk\xk)), i-e. every 

function can be written as a sum of at most 2K -f 1 
nomographic functions Ml/- 

Due to Assumptions A. 1-2, we consider the second case and 
impose an additional constraint of a single function ip. Note 
that the implication of Fact 1.2) is that only a sparse subset 

^Due to the structure of the nomographic representation, every ordinary 
sensor can act as fusion node. 


of functions / € C{X^) can be approximated by some / S 
ffciX^) with arbitrary high precision. The derived framework 
provides a necessary condition for suitable / to exist as well 
as the corresponding inner and outer functions. 

B. Theoretical framework: Analysis of variance 

To establish our results, we resort to a general frame¬ 
work for a dimensionwise decomposition of a function f{x) 
into a sum of lower-dimensional terms. More precisely, we 
consider the Analysis of Variance (ANOVA) framework ||9|, 
m, which has also been considered in the context of many 
other applications, ranging from chemistry and finance to 
statistics (see e.g. im). The goal of this framework is to 
decompose a function / G C 2 i^^) into a sum of 2^ 
functions fs that are mutually orthogonal w.r.t. the inner 
product (/, p) = fix) ■ g{x) dx. Here, the function / 
is decomposed into a sum of lower dimensional functions 

fix) = '^^^^fsixs), ( 4 ) 

where each function fs only depends on a subset of variables 
indexed by the set 5 C /C and the sum ranges over the power 
set of 1C. The algorithm to obtain the ANOVA decomposition 
for a given function / € L^i^C^) is given by Alg. [T] 


Input: / G L^iX^) 

Output: functions {fs}scic, variances a^, 
h ■= IxK fix) dx-, 0-0 := 0; 
for 5 c /C, 5 7 ^ 0) do 

fsitxs) ■= fp^K-\s\ fix) dxiQ\s ~ Swcs fuitxu)i 
■= fUxs) dxs-, 

end 

— Lx Pix) dx - iJ^i, f{x) dxf = Y.s<zk 

Algorithm 1: ANOVA decomposition of f II 101 . 

Remark 1. Despite its simple form, the reader should note 
that a numerical implementation of Alg. Q] is in general not 
trivial. In fact, the computation of all 2^ terms for a full 
decomposition becomes impracticable for moderate values of 
K and the involved high-dimensional integrals need to exist 
and be well-defined. In addition to these requirements, the 
integrals might still be hard to obtain in analytical form 
and numerical approximation methods might be necessary. 
However, for some classes of functions including multivariate 
polynomials f G Vk,d{X^), we can easily obtain a truncated 
decomposition in closed form up to moderate values of K. 

The conditions for which a truncated decomposition pro¬ 
vides a good approximation of the original function are made 
precise in the following definition. 

Definition 1. 4701? A function f is said to be of order d if 
f{x) = fsi^s) ^ cr| = (5) 

^ ' 2--^\S\<d ^ ' ^\S\<d * 

and of effective superposition dimension d if, for some given 
sufficiently small e > 0, there holds 

T.^si<d^S>{^-e)a^ ( 6 ) 




where fs{xs), <J% and are obtained by Alg. [7] 

In particular, if a function / is of order one, there are 
no interactions between variables {xk]k^K there exists 
a parametrization of the form f{x) = X]| 5 |<i= 
+ /o (resp. small interaction and ^proxi¬ 
mate parametrization for effective superposition onqj. This 
parametrization is well-known in the literature as Generalized 
additive models Km, ini and it forms the basis for the 
algorithm developed in the following. 

III. Outline of the proposed approach 

To obtain a parametrization of type (|2]i the idea is to ’’skew” 
the function / with a bijection g G C : ft ^ ft', such that the 
resulting function (p(x) := {g o f)(x) : A" O' is of order 
one according to Def. [T] Here, the required bijectiveness of g 
ensures that a unique functional inverse ip := g~^ € C exists 
and the resulting parametrization in nomographic form can be 
given by 

fix) = {g~^ogof)ix) = ■ (' 7 ) 

Accordingly, given some approximation constant e > 0 in 
Def.n a nomographic approximation is obtained if ifiix) is 
of effective superposition dimension 1. These statements are 
summarized in the following Lemma. 

Lemma 1. Let f : ft G C{X^) and g : ft ^ 

ft' G C{ft) be a bijection with inverse ip := g~^. If 
<p{x) := {g o f){x) G C 2 {X^) is of order 1, we obtain a 
nomographic representation / = '0(X]|5|<i 
ANOVA decomposition of pix) and the identity (ip o g o 
f)(x) := (ip o Lp)(x) = ip(Y.\s\<i'Psixs))- Similarly, if 
(g o f)(x) is of effective superposition dimension 1 (given 
e > 0), we obtain a nomographic approximation, where the 
approximation is optimal in an C 2 sense (or, equivalently, the 
minimum variance sense) of the inner approximation problem 

SMS- 

Remark 2. The reader should note however, that due to 
the usually nonlinear transformation by the outer function 
Ip, this £2 optimality does not need to hold for the overall 
approximation error. Note also that bounding the resulting 
supremum norm of the approximation error sup^, 

S| 5 |<d £‘S(^‘S)I interesting open problem but out of 

scope of this implementation-oriented paper. We highlight, that 
the required analysis seems to be quite challenging but might 
be similar in spirit to the simpler case of approximation with 
ridge functions (see e.g. 1 1751/ ). 


Lemma 2. Let g(^) G 'Pi,d-i ■= Y.d=o be 

a real polynomial of degree D — 1 defined on [0,1] with 
real coefficients z := [zi,...,Z£)] and M G be 

a lower triangular matrix with entries given by [M]ij = 
(jZi) (^-/) — j’ = 0 Vi < j. Then it holds 

that 


min [Mz]i < g(^) < max [Mz]i. (8) 

i i 

As a continuous function on a closed interval is bijective iff 
it is strictly monotone, we may obtain a suitable set of bijec- 
tions by bounding > 0 V ^ G [0,1] and integrating the 
polynomial g(^) w.r.t. A formal proposition characterizing 
the resulting set is given in the following: 

Proposition 1. Let g(^) G Vi^d be a polynomial in ^ of degree 
D defined on [0,1]. Then, g(^) is monotone and continuous 
on [0,1] if it holds that g G V(z, c) C with 



z£Z\o}, 

(9) 

Z := {z\Mz > 0}, 


(10) 

M := Mdiag([l,2,3,. 


(11) 


Proof. The proof follows from Lemma |2] by integration and 
comparison of terms. □ 

The reader should note, that Z defines a polyhedral cone 
(where the origin needs to be excluded to ensure strict mono¬ 
tonicity for the set V(z, c)) which makes it viable for practical 
optimization algorithms. 


B. Nomographic Approximation by cone-constrained Rayleigh 
quotient optimization 

It remains to study the structure of the inner approximation 
problem. To this end, let g G Pi,d L> V(z, c) with domain 
fl := [0,1]. Then we can establish the following results on 
and (T^. 

Lemma 3. Let g G Vi^d, P '■= (g ° f) ^ 7^2([0, l]'^) and f : 
[0, 1]^ —>• [0,1]. Then, it holds that ^^(x) dx — 

(7at^ can be written in quadratic form = 
z^Bz with B := - 6 ( 2 ) 5 ( 2 ).t g 

\B^^\j := [ dx, := f /(a:)* dx 

7 [ o . i]k dlOAW 

( 12 ) 


which is independent of the chosen constant a 

Proof. The proof is deferred to Appendix lAl □ 


A. A class of monotone polynomials 

To obtain a computationally tractable set of continuous 
bijections g, we consider a class of polynomials known as 
Bernstein polynomials'. 

^For convenience, subsequent designations on the approximate case are 
only made when a precise distinction is necessary. 


Lemma 4. Let g, p and f be as in Lemma\^and let the mixed 
integrals 

:= [ f [ f(xy dx,c\k (13) 

7[o,i] 

X / f(xy dXfCKk ) dxk WkinK. 

J[o,iW-^ J 



exist and be finite. Then, we have A^z \/k G 1C with 

Ak := (fc) - 6(2)5(2 ),t g ^^4^ 

Proof. The proof is deferred to Appendix |B] □ 

Now, with the above results in hand, we are in a position 
to state our main result. 


Proposition 2. Let g, (p and f be as in Lemma |5] Then, given 
some e > 0 , a function f : [ 0 , 1 ]^ —>■ [ 0 , 1 ] has a nomographic 
approximation in accordance with Def. Q] and Rem. |2] with 
continuous and monotone outer function f and continuous 
inner functions pu if and only if 


(l-£)< 


z^Az 


max 


zg2\o z'^Bz^ 


(15) 


where A = the matrices Ak, k = 1... K, B 

and M are given by Lemma |5] |4] and Prop. [7] 


Proof. Let e € (0,1) be given and arbitrary. Then, by Def. [T] 
and (l7]i, a function / has a nomographic approximation if (and 
only if) there exists g G V(z) such that p(x) = (g o f ){x) 
is of effective superposition dimension 1 and (1 — er) < i? := 
SZ|s^<i yvhere both cr| = a^, k = 1... AT, and 

depend on 2 ; 7 ^ 0 (through the skewing function g). Now 
taking the maximum (which exists) of i? = R{z) over all 
z G 2 \ 0, and by considering Lemma [3 and Lemma |4] with 
0 , we can conclude that / has a nomographic approximation 
if and only if (fTSl) holds. □ 


By applying the matrix lift Z := one can show 

that if the matrix B is nonsingular, the optimization problem 
outlined in Prop. |2] can be recast as the generally nonconvex 
optimization problem 


Z* G argmax tr {AZ} (16a) 

s.t. tr {BZ} = (5 (16b) 

Z h 0 (16c) 

[MZM'^]ij>0y {i,j}GlC^ (16d) 
rank(Z) = 1. (16e) 


Due to the high complexity of solving (fThl l directly, we apply 
a technique known as semidefinite relaxation iflTl to the 
nonconvex semidefinite program by neglecting the (nonconvex) 
rank constraint (I16el i first and solving the resulting convex 
SDR Then, the candidate solution set for the original problem 
(fThl l is given by {±i/Xi'qi}, where Ai and qi denote the 
largest eigenvalue and eigenvector of Z*. The relaxation is 
tight, i.e. the solution z* G 2 n {2\/Xiqi\ of the SDR 
coincides with the solution to (fThl) if rank(Z*) = 1 . If the 
rank constraint is violated, the solution will in general be 
suboptimal. In this case, we apply a heuristic to obtain a 
suboptimal feasible solution 

a?) 


where (•)+ denotes the projection onto the positive orthant. 
This heuristic is motivated by our simulation results, which 
show that applying the projection onto the positive orthant 
after transformation by the matrix M yields a numerically 
much more stable solution compared to applying the projection 
directly by computing 

z* e argmin|^|jy^,,>o|min{||z - ^/>^ql\\l, ||z + \Aigi|| 2 }- 
The resulting overall approximation algorithm is described in 

Alg.|20 


Input: / : [0,1]^ ^ [0,1], D 

Output: {¥>5}|5|<i^ V' 

(1) Compute A, B, M using Lemma [3 E] and |2j 

(2) Compute z* by solving SDR of (O; 

(3) Compute ANOVA for g* o f with g* := V(z*); 

(4) Compute ip using numerical inversion of p*; 

Algorithm 2: Approximation of / by V'(^| 5 |<i Ps{xs)). 

IV. Numerical Results 

To evaluate the performance of the proposed algorithm, we 
simulate a network with two sensors measuring xi and X 2 and 
one fusion node (see also Fig. [T]) that are deployed to compute 
a desired function 

1 2 

f{xi,X2) = -{xi+XiX2+X2) (18) 

in a distributed manner. We choose the given example to 
highlight the impact of optimizing the skewing function to 
cancel out the interaction among the variables through the 
product X 1 X 2 , where it is rather counter-intuitive that a good 
nomographic approximation exists. Many other interesting 
functions may come from data-driven models for distributed 
regression/classification using polynomial kernels which is 
beyond the scope of this paper. Using the algorithm described 
in Alg. 13 with skewing function g of degree D = 20, we find 
a nomographic approximation with objective value e = 10“^. 
The resulting functions and the overall approximation error are 
given in Table HJ and Fig. |3 where it can be seen in Fig. |3f) 
that the resulting overall approximation error is bounded by 
l/~^(S| 5 |<i‘F<s)l — 6x10“^. To highlight the effectiveness 
of using an optimized skewing function, the variances for 
approximation by a purely additive model (i.e. without using a 
skewing function) are given in Table |I] as a reference resulting 
in an objective value of e = 0.12. 

the spirit of reproducible research, the corresponding MATLAB imple¬ 
mentation of Alg. [ 2 ] will be made available on one of the authors websites. 


= 0.0168 

0 -^ 2 ^ = 0.0168 

2 | — 0.0043 

0-2 = 0.038 

(^{ 1 } +‘^ 12 })'"“^ = 0-88 

£ = 0.12 


Direct evaluation of Alg.|T]for given test function. 
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Fig. 2. Simulation results for Alg. [2 applied to the function / = 

i {Xl +XIX2 +X2)^. 


V. Conclusion 

In this paper, we studied the problem of nomographic 
approximation with continuous monotone outer function and 
continuous inner functions, which we expect to be of high 
practical relevance due to the amenability for distributed 
computation. By using a computationally tractable class of 
bijections based on Bernstein polynomials, we obtain a nomo¬ 
graphic approximation with prescribed properties with respect 
to a defined distortion metric. The optimized approximation is 
obtained by the maximization of a cone-constrained Rayleigh- 
quotient. Since the problem is nonconvex, we consider its 
semidefinite relaxation. Though a precise characterization of 
the class of functions approximable in nomographic form 
with prescribed error metric still remains an open problem, 
we can see some interesting applications of the presented 
results, ranging from distributed learning and optimization to 
compressed classification. 
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Appendix A 

Proof of = z^Bz, independence oe c 

To show that is independent of c let p G Vi^d, 'fix) := 
{g ° f)[x) G C 2 {X^), ■■= [0,1]^ and / := }[x). Then 

, 2 


^2 = 


IXX 


'^[xY dx — 


A(1) 


if K 
'-- 

A(2) 

2 




= ( I, Ell 

- L & X- ‘‘‘ '‘4 


As 2 ; is independent of / it follows by comparison of terms 
that 0-2 = z'^Bz with B := - 6 ( 2 ) 5 ( 2 ).t 


Bf := / r+^ dx, 

J[0,1]X 

bf-.= [ f dx, 

Jl0,l]X 














































Appendix B 

Proof of a1 = Auz, independence of c 

To show that tr^ = z'^A^z is independent of c let g, ip{x), and / be as in Appendix lAl Then 

o'fe = / ( / ‘^{x) - / if{x) dx ] dxk 

Jx \JxK-^ Jx‘< J 

^ Zdf + dxK,\k - J ^ dx 

^ Zdf‘^ + c j dxic\k dxk+ I I 

\d=l / J 

/ D \ 


lx 


lx 


IxK-i 


tXX-^ 


lx 


dxk 


Ix^ 


d=l 


' c I dx 


dxk 


- 2 


lx 


D 


lx” 


Y,Zdf + c\ dX;c\k- / ^ 

\d=l ' ^ '■''-1 


dx 


^d=l 


dxk 


/ D \ r D “I 

f \ f dx^\k ] dxk +2c [ f V Zdf‘^ dx,c\k dxk + f 

Jx\JxK-”^. / Jx JxK-”‘^, Jpc 


d=l 
D 


dxk + / c dxk 

IX 


D 


- 2 


lx XJxx-” 


^ Zdf‘^) dx^\k ■ / X! ^df'‘ dx dxk 


d=l 


XK 


d=l 


- 2c 


+ 


J ^ ^ Zdf‘^ dx/c\k"j dxk-2cj ^ X! ^df‘^ dx^ dxk 

X! dx dxk+ dxk, 
d=i / dx 


— 2 (? dxk 


lx \Jx”< 


X ^df'^ dx dxk + 2c 


A \Jx”< 


where we note that all terms involving c vanish and we obtain 

2 


CTfc = 


Ix It ^ Ix (X-- It ■ 


D 




X ^df^ dx dxk 


d=l 


r(i) 


r(2) 


j ^ Xj ^df‘^ dx^ dxk, with 


r(3) 


= 


D D 

i=i j=i 


X \JxK-” 


r dx,c\k 


Ixx-” 


( D D 

EE 

i=l J = 1 

dxic\k ) dxk, 


[ ([ dx^\^ dxk= [ [EE^*^J / fdx^\k-[ P dxic\A dxk 

Jx\Jx”<-”^^ J Jx”<-” Jx”<-” 


which can be written as T^^^ = z"^A^^\k)z with [A^'^\k)\ij := Jxifx’^-iP dxiQ\k ‘ dx^i^^k) dxk- For the 

remaining terms we compute 


r(2) = 


lx \Jx”<- 


Ix \Jx”<- 


'^Zdp^ dxk:\k- [ '^Zdf^^ dx\ dxk = [ ( [ Zdp^ dxK:\k ■ z] dxk 

^ Xt dx”< J Jx \Jx”<-” J 

X ^df‘^ dx!c\i^ dxk ■ P^'>’'^Z = [ X ^df^ dx ■ b^'^^''^z = z'^b^^'>z 


d^l 




— 


D 


Ix \Jx^ 


Zdf^ dx dxk = 




fx 


D 


D 


Zrf / f'^dx-'^Zd dx\ dxk = z 




Ix^ 




Ix^ 


and obtain the desired result cr^ = z^ z = z^ A^^\k) — 


2 . 






















